Notebook

Numerical operations with Numpy¶

3.1 Broadcasting¶

3.2 Array shape manipulation¶

3.3 Sorting data¶

Summary¶

Exercises¶

In [1]:

importnumpyasnpimportmatplotlib.pyplotasplt%matplotlib inline 

3.1 Broadcasting Operations¶

We just covered basic operations (add, multiple, square etc) such are element-wise but that works on arrays of same size
Broadcasting comes handy when we are dealing with different shapes. This time, we'll explore a more advanced concept in numpy called broadcasting.
The term broadcasting describes how numpy treats arrays with different shapes during arithmetic operations. Subject to certain constraints, the smaller array is "broadcast" across the larger array so that they have compatible shapes.
Broadcasting provides a means of vectorizing array operations so that looping occurs in C instead of Python. It does this without making needless copies of data and usually leads to efficient algorithm implementations. There are also cases where broadcasting is a bad idea because it leads to inefficient use of memory that slows computation.
In this little tutorial we will provide a gentle introduction to broadcasting with numerous examples ranging from simple to involved.
We will also go through a few examples of when to and when not to use boradcasting.

This example below shows how broadcasting works¶

So, lets start taking baby steps...¶

Here an element-wise multiplication occurs since the two arrays are of same shape

In [2]:

e=np.array([1.0,2.0,3.0])f=np.array([2.0,2.0,2.0])e*f

Out[2]:

array([2., 4., 6.])

Hint / Try it?¶

What would have happened if f = np.array([2.0, 2.0]). would it still multiply?

In [3]:

# But if it was like thise=np.array([1.0,2.0,3.0])f=2.0e*f

Out[3]:

array([2., 4., 6.])

What happened here¶

This is the most simplest example on numpy broadcasting where an array and a scalar values were combined in an operation.

so it kind of stechted in the row direction! The scalar f is stretched to become an array of with the same shape as e so the shapes are compatible for element-by-element multiplication.

** So what are the rules then?**

They must either be equal / same shape

One of them must be 1, like f was above

In [4]:

# Typical broadcasting in practiceg=np.array([[0.0,0.0,0.0],[10.0,10.0,10.0],[20.0,20.0,20.0],[30.0,30.0,30.0]])g

Out[4]:

array([[ 0., 0., 0.], [10., 10., 10.], [20., 20., 20.], [30., 30., 30.]])

In [5]:

h=np.array([1.0,2.0,3.0])h

Out[5]:

array([1., 2., 3.])

In [6]:

g+h

Out[6]:

array([[ 1., 2., 3.], [11., 12., 13.], [21., 22., 23.], [31., 32., 33.]])

What happened above?¶

A 2-D (two-dimensional) array multiplied by 1-D (one-dimensional) array. It got stretched in the column direction so as to match the elements of the 2D array columns.

Would the same be possible for different shapes? Does broadcasting magically understands and fixes our assumptions?

Let's take a look...

In [7]:

g=np.array([[0.0,0.0,0.0],[10.0,10.0,10.0],[20.0,20.0,20.0],[30.0,30.0,30.0]])i=np.array([0.0,1.0,2.0,3.0])g+i

---------------------------------------------------------------------------ValueError Traceback (most recent call last) <ipython-input-7-3eb92f04cdb1> in <module> 1 g = np.array([[0.0,0.0,0.0],[10.0,10.0,10.0],[20.0,20.0,20.0],[30.0,30.0,30.0]]) 2 i = np.array([0.0,1.0,2.0,3.0])----> 3g+i ValueError: operands could not be broadcast together with shapes (4,3) (4,)

We had a mismatch...¶

Explanation: When the trainling dimensions of the arrays are different as you saw above, then broadcasting will fail making it impossible to align the values in the rows of the first array with the elements of the second array for an element-by-element addition or multiplication.

Also, is there a way to do this in one line of code¶

Tip: look up more into np.tile and np.arange

In [8]:

a=np.tile(np.arange(0,40,10),(3,1))a=a.T# transpose thisa

Out[8]:

array([[ 0, 0, 0], [10, 10, 10], [20, 20, 20], [30, 30, 30]])

In [9]:

b=np.array([0,1,2])b

Out[9]:

array([0, 1, 2])

Now, we add these two¶

In [10]:

a+b

Out[10]:

array([[ 0, 1, 2], [10, 11, 12], [20, 21, 22], [30, 31, 32]])

So you see that broadcasting was applied magically...¶

Ask yourself, why couldn't we add original a and b ?

Note, original a was:

array([[0,10,20,30],[0,10,20,30],[0,10,20,30]])

In [11]:

c=np.ones((5,6))c

Out[11]:

array([[1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.]])

Let's assign an array of dimension 0 to an array of dimension 1¶

In [12]:

c[0]=2c

Out[12]:

array([[2., 2., 2., 2., 2., 2.], [1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.], [1., 1., 1., 1., 1., 1.]])

In [13]:

d=np.arange(0,30,10)d

Out[13]:

array([ 0, 10, 20])

In [14]:

d.shape

Out[14]:

(3,)

In [15]:

d=d[:,np.newaxis]# Here we add a new axis and make it a 2D arrayd.shape

Out[15]:

(3, 1)

In [16]:

a+d

---------------------------------------------------------------------------ValueError Traceback (most recent call last) <ipython-input-16-4fbab87c839c> in <module>----> 1a + d ValueError: operands could not be broadcast together with shapes (4,3) (3,1)

Another example on broadcasting¶

Let’s construct an array of distances (in miles) between cities of Route 66: Chicago, Springfield, Saint-Louis, Tulsa, Oklahoma City, Amarillo, Santa Fe, Albuquerque, Flagstaff and Los Angeles.

In [17]:

mileposts=np.array([0,198,303,736,871,1175,1475,1544,1913,2448])distance_array=np.abs(mileposts-mileposts[:,np.newaxis])distance_array

Out[17]:

array([[ 0, 198, 303, 736, 871, 1175, 1475, 1544, 1913, 2448], [ 198, 0, 105, 538, 673, 977, 1277, 1346, 1715, 2250], [ 303, 105, 0, 433, 568, 872, 1172, 1241, 1610, 2145], [ 736, 538, 433, 0, 135, 439, 739, 808, 1177, 1712], [ 871, 673, 568, 135, 0, 304, 604, 673, 1042, 1577], [1175, 977, 872, 439, 304, 0, 300, 369, 738, 1273], [1475, 1277, 1172, 739, 604, 300, 0, 69, 438, 973], [1544, 1346, 1241, 808, 673, 369, 69, 0, 369, 904], [1913, 1715, 1610, 1177, 1042, 738, 438, 369, 0, 535], [2448, 2250, 2145, 1712, 1577, 1273, 973, 904, 535, 0]])

Another example¶

A lot of grid-based or network-based problems can also use broadcasting. For instance, if we want to compute the distance from the origin of points on a 10x10 grid, we can do

In [18]:

x,y=np.arange(5),np.arange(5)[:,np.newaxis]distance=np.sqrt(x**2+y**2)distance

Out[18]:

array([[0. , 1. , 2. , 3. , 4. ], [1. , 1.41421356, 2.23606798, 3.16227766, 4.12310563], [2. , 2.23606798, 2.82842712, 3.60555128, 4.47213595], [3. , 3.16227766, 3.60555128, 4.24264069, 5. ], [4. , 4.12310563, 4.47213595, 5. , 5.65685425]])

Or in color...¶

In [19]:

plt.pcolor(distance)plt.colorbar

Out[19]:

<function matplotlib.pyplot.colorbar(mappable=None, cax=None, ax=None, **kw)>

In [20]:

# Note : The numpy.ogrid function allows to directly create vectors# x and y of the previous examplex,y=np.ogrid[0:5,0:5]x,y

Out[20]:

(array([[0], [1], [2], [3], [4]]), array([[0, 1, 2, 3, 4]]))

In [21]:

x.shape,y.shape

Out[21]:

((5, 1), (1, 5))

np.ogrid is quite useful as soon as we have to handle computations on a grid. While on other hand, np.mgrid directly provides matrices full of indices for cases where we can't or maybe don't want to benefit from broadcasting.

In [22]:

x,y=np.mgrid[0:4,0:4]x

Out[22]:

array([[0, 0, 0, 0], [1, 1, 1, 1], [2, 2, 2, 2], [3, 3, 3, 3]])

In [23]:

Out[23]:

array([[0, 1, 2, 3], [0, 1, 2, 3], [0, 1, 2, 3], [0, 1, 2, 3]])

A bit on Vector quantization or VQ¶

A simple way to understand bradcasting is with this real world situation. The basic operatio in VQ finds the closest point in a set of points, called $codes$ in VQ speak, to a given point, called the observation.

In the 2D example below, the value in an $observation$ describe the weight and height of an athlete to be classified. The $codes$ represent different classes of athletes such as dancer, runner, swimmer an so on.

Finding the closest point requires calculating the distance between observation and each of the codes.

The shortest distance provides the best match. Here in this example, codes[0] is the closest class indicating that the athlete is likely a basketball player.

In [24]:

fromnumpyimportarray,argmin,sqrt,sumobservation=array([111.0,188.0])codes=array([[102.0,203.0],[132.0,193.0],[45.0,155.0],[57.0,173.0]])

In [25]:

# This is how broadcast happensdifference=codes-observationdistance=sqrt(sum(difference**2,axis=-1))nearest=argmin(distance)nearest

Out[25]:

The basic operation of vector quantization calculates the distance between an object to be classified, the black square, and multiple known codes, the gray circles. In the very basic case, the codes represent classes.

A more advanced example¶

@article{scikit-learn, title={Scikit-learn: Machine Learning in {P}ython}, author={Pedregosa, F. and Varoquaux, G. and Gramfort, A. and Michel, V. and Thirion, B. and Grisel, O. and Blondel, M. and Prettenhofer, P. and Weiss, R. and Dubourg, V. and Vanderplas, J. and Passos, A. and Cournapeau, D. and Brucher, M. and Perrot, M. and Duchesnay, E.}, journal={Journal of Machine Learning Research}, volume={12}, pages={2825--2830}, year={2011} }

In [26]:

# A more complex exampleimportnumpyasnpimportscipyasspimportmatplotlib.pyplotaspltfromsklearnimportclustertry:# SciPy >= 0.16 have face in miscfromscipy.miscimportfaceface=face(gray=True)exceptImportError:face=sp.face(gray=True)n_clusters=5np.random.seed(0)X=face.reshape((-1,1))# We need an (n_sample, n_feature) arrayk_means=cluster.KMeans(n_clusters=n_clusters,n_init=4)k_means.fit(X)values=k_means.cluster_centers_.squeeze()labels=k_means.labels_# create an array from labels and valuesface_compressed=np.choose(labels,values)face_compressed.shape=face.shapevmin=face.min()vmax=face.max()# original faceplt.figure(1,figsize=(3,2.2))plt.imshow(face,cmap=plt.cm.gray,vmin=vmin,vmax=256)# compressed faceplt.figure(2,figsize=(3,2.2))plt.imshow(face_compressed,cmap=plt.cm.gray,vmin=vmin,vmax=vmax)# equal bins faceregular_values=np.linspace(0,256,n_clusters+1)regular_labels=np.searchsorted(regular_values,face)-1regular_values=.5*(regular_values[1:]+regular_values[:-1])# meanregular_face=np.choose(regular_labels.ravel(),regular_values,mode="clip")regular_face.shape=face.shapeplt.figure(3,figsize=(3,2.2))plt.imshow(regular_face,cmap=plt.cm.gray,vmin=vmin,vmax=vmax)# histogramplt.figure(4,figsize=(3,2.2))plt.clf()plt.axes([.01,.01,.98,.98])plt.hist(X,bins=256,color='.5',edgecolor='.5')plt.yticks(())plt.xticks(regular_values)values=np.sort(values)forcenter_1,center_2inzip(values[:-1],values[1:]):plt.axvline(.5*(center_1+center_2),color='b')forcenter_1,center_2inzip(regular_values[:-1],regular_values[1:]):plt.axvline(.5*(center_1+center_2),color='b',linestyle='--')plt.show()

3.2 Array Shape Manipulation¶

Flattening¶

In [27]:

a=np.array([[1,2,3],[4,5,6]])a.ravel()"""A 1-D array, containing the elements of the input, is returned. A copy is made only if needed. Do help(np.ravel) to learn more"""

Out[27]:

'\nA 1-D array, containing the elements of the input, is returned. A copy is\n made only if needed.\n Do help(np.ravel) to learn more\n'

In [28]:

a.T

Out[28]:

array([[1, 4], [2, 5], [3, 6]])

Reshaping¶

In [29]:

a.shape

Out[29]:

(2, 3)

In [30]:

a.reshape(-1)

Out[30]:

array([1, 2, 3, 4, 5, 6])

In [31]:

b=a.ravel()b

Out[31]:

array([1, 2, 3, 4, 5, 6])

In [32]:

b=b.reshape((2,3))b

Out[32]:

array([[1, 2, 3], [4, 5, 6]])

In [33]:

# Which is same as ...a.reshape(2,-1)

Out[33]:

array([[1, 2, 3], [4, 5, 6]])

In [34]:

# Note: ndarray.reshape may return a view (cf help(np.reshape))), or copyb[0,0]=99a

Out[34]:

array([[99, 2, 3], [ 4, 5, 6]])

In [35]:

# Reshape also returns a copy, take a looka=np.zeros((3,2))b=a.T.reshape(3*2)b[0]=9a

Out[35]:

array([[0., 0.], [0., 0.], [0., 0.]])

Memory layout of a numpy array¶

Here's a good example of how it works

In [36]:

x=np.random.rand(2,2)x.data

Out[36]:

<memory at 0x7faf84de2ee0>

In [37]:

x.__array_interface__['data']

Out[37]:

(140391821883968, False)

In [38]:

x[0].__array_interface__['data']

Out[38]:

(140391821883968, False)

In [39]:

x[0,:].__array_interface__['data']

Out[39]:

(140391821883968, False)

In [40]:

x[1,:].__array_interface__['data']

Out[40]:

(140391821883984, False)

In [41]:

x[0,0].__array_interface__['data']

Out[41]:

(140391740312704, False)

3.3 Sorting Data¶

Function

sort (arr, axis=-1, kind='quick', order=None)

Method

arr.sort (axis=-1, kind='quick', order=None)

In [42]:

# Sorting along an axis. see what happens?a=np.array([[1,4,3],[3,1,3]])b=np.sort(a,axis=1)print(b)

[[1 3 4] [1 3 3]]

In [43]:

# In-place sorta.sort(axis=1)print(a)

[[1 3 4] [1 3 3]]

In [44]:

# Sorting with fancy indexinga=np.array([5,4,6,1])x=np.argsort(a)x

Out[44]:

array([3, 1, 0, 2])

In [45]:

# Finding minima and maximab=np.array([3,5,2,6])b_max=np.argmax(b)b_min=np.argmin(b)print(b_max)print(b_min)

3 2

Some Exercises 😅¶

1. Array manipulations¶

Create this 2D array (without typing manually)

[[1, 7, 12], [2, 8, 13], [3, 9, 14], [4, 10, 15], [5, 11, 16]]

2.¶

Fun Exercises: Challenge questions¶

Try in-place, out_of_place sorting
Create arrays with different dtypes and sort them.
Use all or array_equal to see what it returns
Use np.random.shuffle to create a more sortable input
Combine ravel, sort and reshape in one
Look at the axis keyword for sort and rewrite the previous exercise

In [46]:

a=np.arange(25).reshape(5,5)a

Out[46]:

array([[ 0, 1, 2, 3, 4], [ 5, 6, 7, 8, 9], [10, 11, 12, 13, 14], [15, 16, 17, 18, 19], [20, 21, 22, 23, 24]])

In [47]:

help(np.sum)

Help on function sum in module numpy: sum(a, axis=None, dtype=None, out=None, keepdims=<no value>, initial=<no value>, where=<no value>) Sum of array elements over a given axis. Parameters ---------- a : array_like Elements to sum. axis : None or int or tuple of ints, optional Axis or axes along which a sum is performed. The default, axis=None, will sum all of the elements of the input array. If axis is negative it counts from the last to the first axis. .. versionadded:: 1.7.0 If axis is a tuple of ints, a sum is performed on all of the axes specified in the tuple instead of a single axis or all the axes as before. dtype : dtype, optional The type of the returned array and of the accumulator in which the elements are summed. The dtype of `a` is used by default unless `a` has an integer dtype of less precision than the default platform integer. In that case, if `a` is signed then the platform integer is used while if `a` is unsigned then an unsigned integer of the same precision as the platform integer is used. out : ndarray, optional Alternative output array in which to place the result. It must have the same shape as the expected output, but the type of the output values will be cast if necessary. keepdims : bool, optional If this is set to True, the axes which are reduced are left in the result as dimensions with size one. With this option, the result will broadcast correctly against the input array. If the default value is passed, then `keepdims` will not be passed through to the `sum` method of sub-classes of `ndarray`, however any non-default value will be. If the sub-class' method does not implement `keepdims` any exceptions will be raised. initial : scalar, optional Starting value for the sum. See `~numpy.ufunc.reduce` for details. .. versionadded:: 1.15.0 where : array_like of bool, optional Elements to include in the sum. See `~numpy.ufunc.reduce` for details. .. versionadded:: 1.17.0 Returns ------- sum_along_axis : ndarray An array with the same shape as `a`, with the specified axis removed. If `a` is a 0-d array, or if `axis` is None, a scalar is returned. If an output array is specified, a reference to `out` is returned. See Also -------- ndarray.sum : Equivalent method. add.reduce : Equivalent functionality of `add`. cumsum : Cumulative sum of array elements. trapz : Integration of array values using the composite trapezoidal rule. mean, average Notes ----- Arithmetic is modular when using integer types, and no error is raised on overflow. The sum of an empty array is the neutral element 0: >>> np.sum([]) 0.0 For floating point numbers the numerical precision of sum (and ``np.add.reduce``) is in general limited by directly adding each number individually to the result causing rounding errors in every step. However, often numpy will use a numerically better approach (partial pairwise summation) leading to improved precision in many use-cases. This improved precision is always provided when no ``axis`` is given. When ``axis`` is given, it will depend on which axis is summed. Technically, to provide the best speed possible, the improved precision is only used when the summation is along the fast axis in memory. Note that the exact precision may vary depending on other parameters. In contrast to NumPy, Python's ``math.fsum`` function uses a slower but more precise approach to summation. Especially when summing a large number of lower precision floating point numbers, such as ``float32``, numerical errors can become significant. In such cases it can be advisable to use `dtype="float64"` to use a higher precision for the output. Examples -------- >>> np.sum([0.5, 1.5]) 2.0 >>> np.sum([0.5, 0.7, 0.2, 1.5], dtype=np.int32) 1 >>> np.sum([[0, 1], [0, 5]]) 6 >>> np.sum([[0, 1], [0, 5]], axis=0) array([0, 6]) >>> np.sum([[0, 1], [0, 5]], axis=1) array([1, 5]) >>> np.sum([[0, 1], [np.nan, 5]], where=[False, True], axis=1) array([1., 5.]) If the accumulator is too small, overflow occurs: >>> np.ones(128, dtype=np.int8).sum(dtype=np.int8) -128 You can also start the sum with a value other than zero: >>> np.sum([10], initial=5) 15

In [48]:

help(np.matrix.sum)

Help on function sum in module numpy.matrixlib.defmatrix: sum(self, axis=None, dtype=None, out=None) Returns the sum of the matrix elements, along the given axis. Refer to `numpy.sum` for full documentation. See Also -------- numpy.sum Notes ----- This is the same as `ndarray.sum`, except that where an `ndarray` would be returned, a `matrix` object is returned instead. Examples -------- >>> x = np.matrix([[1, 2], [4, 3]]) >>> x.sum() 10 >>> x.sum(axis=1) matrix([[3], [7]]) >>> x.sum(axis=1, dtype='float') matrix([[3.], [7.]]) >>> out = np.zeros((2, 1), dtype='float') >>> x.sum(axis=1, dtype='float', out=np.asmatrix(out)) matrix([[3.], [7.]])

In [49]:

np.sum([1.0,1.5])

Out[49]:

2.5

In [50]:

np.sum([1.0,0.4,0.5,0.6],dtype=np.int32)

Out[50]:

In [51]:

np.sum([[0,2],[0,6]])

Out[51]:

In [52]:

np.sum([[0,2],[0,6]],axis=0)

Out[52]:

array([0, 8])

In [53]:

np.sum([[0,2],[0,6]],axis=1)

Out[53]:

array([2, 6])

In [ ]: